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GLP-1 ANALOG FUSION PROTEINS 

FIELD OF THE INVENTION 

The present invention relates to glucagon-like peptide analogs fused to proteins 

5 that have the effect of extending the in vivo half-life of the peptides. These fusion 

proteins can be used to treat diabetes as well as a variety of other conditions or disorders. 

Glucagon-like peptide- 1 (GLP-l) analogs and derivatives show promise in clinical 
trials for the treatment of type 2 diabetes. GLP-l induces numerous biological effects 
such as stimulating insulin secretion, inhibiting glucagon secretion, inhibiting gastric 

10 emptying, inhibiting gastric motility or intestinal motility, and inducing weight loss. A 
significant characteristic of GLP-l is its ability to stimulate insulin secretion without the 
associated risk of hypoglycemia that is seen when using insulin therapy or some types of 
oral therapies that act by increasing insulin expression. 

The usefulness of therapy involving GLP-l peptides has been limited by the fact 

15 that GLP-l(l-37) is poorly active, and the two naturally occurring truncated peptides, 
GLP-l(7-37)OH and GLP-1(7-36)NH 2 , are rapidly cleared in vivo and have extremely 
short in vivo half lives. It is known that endogenously produced dipeptidyl-peptidase IV 
(DPP-IV) inactivates circulating GLP-l peptides by removing the N-terminal histidine 
and alanine residues and is a major reason for the short in vivo half-life. 

20 Various approaches have been undertaken to extend the elimination half-lif e of a 

GLP-l peptide or reduce clearance of the peptide from the body while maintaining 
biological activity. One approach involves fusing a GLP-l peptide to the Fc portion of an 
immunoglobulin. Immunoglobulins typically have long circulating half-lives in vivo. For 
example, IgG molecules can have a half-life in humans of up to 23 days. The Fc portion 

25 of the immunoglobulin is responsible, in part, for this in vivo stability. GLP-l-Fc fusion 
proteins take advantage of the stability provided by the Fc portion of an immunoglobulin 
while preserving the biological activity of the GLP-l molecule. 

Although this approach is feasible for QLP-1 therapeutics (See WO 02/46227), 
there is a general concern regarding the antigenicity of various fusion proteins when 

30 administered repeatedly over prolonged periods of time. This is especially a concern for 
GLP-l -Fc fusion therapeutics as a patient with diabetes must be treated for her entire life 
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once diagnosed with the disease. In addition, Fc fusion protein therapeutics can be a 
concern if the Fc portion retains unwanted effector functions. 

The present invention seeks to overcome the problems associated with the 
potential immunogenicity and effector activity associated with administration of GLP-1- 
Fc fusions by identifying specific GLP-l-Fc fusion proteins that have a reduced risk of 
inducing an immune response after repeated and prolonged administration and no longer 
have effector function. These specific fusion proteins have substitutions at various 
positions in the GLP-1 portion as well as the Fc portion of the molecule. The 
substitutions described herein provide increased potency, increased in vivo stability, 
elimination of effector function and decreased likelihood the molecule will be recognized 
by the adaptive elements of the immune system. 

Compounds of the present invention include heterologous fusion proteins 
comprising a GLP-1 analog comprising a sequence selected from the group consisting of 

a) (SEQIDNO:l) 

His-Xaas-Glu-Gly-Thr-Phe-Thr^ 

Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-Trp-Leu-Val-Lys-Gly-Gly-Gly 
wherein Xaag is selected from Gly and Val; 

b) (SEQIDNO:2) 

His-Xaag-Glu-Gly-Thr-Phe-Tlir-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Glu- 
Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-Trp-Leu-Lys-Asn-Gly-Gly-Gly 
wherein Xaag is selected from Gly and Val; 

c) (SEQEDNO:3) 

ffis-Xaa 8 -Glu-Gly-Thr-Phe-T^ 

Gln-Ala-Ala-Lys-Glu-Phe-ne-Ala-Tip-Leu-Val-Lys-Gly-Gly-Pro 
wherein Xaa 8 is selected from Gly and Val; 

d) (SEQ ID NO:4) 

His-Xaag-Glu-Gly-Thr-Phe-Thr-Se^ 

Gin- Ala-Ala-Lys-Glu-Phe-Ile-Ala-Trp-Leu-Lys-Asn-Gly-Gly-Pro 
wherein Xaag is selected from Gly and Val; 

e) (SEQIDNO:5) 

ffis-XaarGlu-Gly-Thr-Phe-Thr-Ser-^ 
Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-Tip-Leu-Val-Lys-Gly-Gly 
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wherein Xaag is selected from Gly and Val; 
f) (SEQ ID NO:6) 

ffisOCaas-Glu-Gly-Thr-Phe-Thr^ 

Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-Tip-Leu-Lys-Asn-Gly-Gly 
wherein Xaag is selected from Gly and Val; 

fused to the Fc portion of an immunoglobulin comprising the sequence of SEQ ID 
NO:7 

Ala-Glu-Ser-Lys-Tyr-Gly-Pro-Pro-Cys-Pro-Pro-Cys-Pro-Ala-Pro- 

Xaai6-Xaai7-Xaai8-Gly-Gly-PrCHSer-Val-Phe-Leu-Phe-Pro-Pro-Lys-Pro- 

Lys-Asp-Thr-Leu-Met-ne-Ser-Arg-Thr-Pro-Glu-Val-Thr-Cys-Val- 

Val-Val-Asp-Val-Ser-Gln-Glu-Asp-PrcHGIu-Val-Gln-Phe-Asn-Ti^- 

Tyr-Val-Asp-Gly-Val-Glu-Val-ffis^ 

Glu-Glu-Gln-Phe-Xaa 80 -Ser-Thr-Tyr-Arg-V^ 

Val-Leu-His-Gln-Asp-Trp-Leu-Asn-Gly-Lys-Glu-Tyr-Lys-Cys-Lys- 

Val-Ser-Asn-Lys-Gly-L^u-Pro-Ser-Ser-ne-Glu-Lys-Thr-Ile-Ser- 

Lys-Ala-Lys-Gly-Gln-Pro-Arg-Glu-Pro-Gln-Vd-Tyr-Thr-Leu~Pro- 

Pro-Ser-Gln-Glu-Glu-Met-Thr-Lys-Asn-Gln-Val-Ser-Leu-Thr-Cys- 

l^u-Val-Lys-Gly-Phe-Tyr-Pro-Ser-Asp-Ile-Ala-Val-Glu-Trp-Glu- 

Ser-Asn-Gly-Gln-Pro-Glu-Asn-Asn-Tyr-Lys-Thr-Thr-Pro-Pro-Val- 

l^u-Asp-Ser-Asp-Gly-Ser-Phe-Phe-I^u-Tyr-Ser-Arg-l^u-Thr-Val- 

Asp-Lys-Ser-Arg-Trp-Gln-Glu-Gly-Asn-Val-Phe-Ser-Cys-Ser-Val- 

Met-His-Glu-Ala-L^-ms-Asn-His-Tyr-Thr-Gln-Lys-Ser-Lxu-Ser- 

Leu-Ser-Leu-Gly-Xaa23o (SEQ ID NO:7) 

wherein: 

Xaa at position 16 is Pro or GIu; 
Xaa at position 17 is Phe, Val, or Ala; 
Xaa at position 18 is Leu, Glu, or Ala; 
Xaa at position 80 is Asn or Ala; and 
Xaa at position 230 is Lys or is absent. 

The C-terminus of the GLP-1 analog portion and the N-terminus of the Fc portion 
of the heterologous fusion proteins of the present invention are preferably fused together 
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via 1, 1.5 or 2 repeats of a G-rich peptide linker having the sequence Gly-Gly-Gly-Gly- 
Ser-Gly-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser (SEQ ID NO:8). 

The present invention also includes polynucleotides encoding the heterologous 
fusion proteins of the present invention, as well as vectors and host cells comprising such 
5 polynucleotides. Methods of treating patients suffering from non-insulin dependent as 
well as insulin dependent diabetes mellitus, obesity, and various other disorders and 
conditions comprising administering the heterologous fusion proteins discussed herein are 
also encompassed by the present invention. 

The heterologous fusion proteins of the present invention comprise a 
10 GLP-1 analog portion and an Fc portion. The GLP-1 analog portion and the Fc 
portion comprise substitutions to the native GLP-1 sequence and the human IgG4 
sequence respectively that provide the protein with increased potency and in vivo 
stability compared to native GLP-1 or GLP-1 analogs not fused to an Fc 
sequence while decreasing the potential for inducing antibody formation after 
15 prolonged and repeated administration in humans. 

Native GLP-1 is processed in vivo such that the first 6 amino acids are cleaved 
from the molecule. Thus, by custom in the art, the amino terminus of GLP-1 has been 
assigned the number 7 and the carboxy-terminus, number 37. The other amino acids in 
the polypeptide are numbered consecutively as shown in SEQ ID NO:9. For example, 
20 position 8 is alanine and position 22 is glycine. The processed peptide may be further 

modified in vivo such that the C-terminal glycine residue is removed and replaced with an 
amide group. Thus, GLP-l(7-37)OH and GLP-1 (7-36)amide represent the two native 
forms of the molecule. GLP-l(7-37)OH has the amino acid sequence of SEQ ID NO:9: 

25 7 ffis-Ala-Glu- 10 Gly-Thr-Phe-^ 

Gln-Ala-^Ala-Lys-Glu-Phe-fle- 3 ^ 
(SEQIDNO:9) 

The GLP-1 analog portion of the heterologous fusion protein comprises 
30 three primary substitutions at positions 8, 22, and 36 relative to native GLP-1(7- 
37). The substitution at position 8 reduces the rate at which the endogenous 
enzyme dipeptidyl-peptidase IV (DPP-IV) inactivates the analog. DPP-IV 
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cleaves native GLP-1 between the 2 nd and 3 rd amino acids (between position 8 
and 9) and the resulting molecule is less active. Thus, the heterologous fusion 
proteins of the present invention are DPP-IV resistant The substitution at 
position 22 reduces the potential of the molecule to aggregate and increases the 

5 potency of the molecule. The substitution at position 36 in the context of the 
analog with changes at 8 and 22 as well as in the context of the entire fusion 
protein reduces the risk that the fusion protein will induce a neutralizing immune 
response after repeated and prolonged administration in humans. 

The central event in the generation of both humoral and cell-mediated immune 

10 responses is the activation and clonal expansion of T-helper (Th) cells. T H cell activation 
is initiated by interaction of the T-cell receptor (TCR)-CD3 complex with a processed 
antigenic peptide bound to a class II major histocompatibility (MHC) molecule in the 
presence of an antigen-presenting cell (APC). Interaction of a T H cell with antigen 
initiates a cascade of biochemical events that induces the resting T H cell to enter the cell 

15 cycle (Go to Gi transition). The activated T cell progresses through the cell cycle, 
proliferating and differentiating into memory cells or effector cells. 

The following sequence was analyzed to identify potential epitopes: 
His-Gly-Glu-Gly-Thr-Phe-^ 
Ala-Lys-Glu-Phe-De-Ala-Tr^ 

20 Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser^ 

Gly-Gly-Gly-Ser-Ala-Glu-Ser-Lys-Tyr-Gly-Pro-Pro-Cys-Pro (SEQ ID NO:10) 

This sequence is a GLP-1 analog sequence with changes at positions 8 
and 22 relative to the native sequence followed by 2 copies of a G-rich linker 
sequence followed by the first 10 amino acids of an Fc region derived from 

25 human IgG4. Epitope as used herein refers to a region of a protein molecule to 
which an antibody can bind. An immunogenic epitope is defined as the part of 
the protein that elicits an antibody response when the whole protein is the 
immunogen. Epitope mapping involved the scanning of sequences using a 
sliding nine amino acid window coupled with advanced statistical analysis 

30 techniques to extract the information contained in these patterns. A proprietary 
software package known as EpiMatrix™ was used to analyze the sequence and 
identify peptides that are highly likely to provoke an immune response when 
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presented to T-cells. Eight highly common alleles were used in the analysis for 
Class II MHC receptor interaction. These alleles included DRB1*0101, 
DRB1*0301, DRB1*0401, DRB1*0701, DRB1*0801, DRB1*1101, 

DRBl*1301,andDRBl*1501. 

» 

5 A strong epitope was predicted to be located at the junction of the C- 

terminus of the GLP-l analog portion and the beginning of the linker. The 
sequence of this epitope is Trp-Leu-Val-Lys-Gly-Arg-Gly-Gly-Gly (SEQ ID 
NO:l 1) which interacts with DRB 1*0801. The present invention encompasses 
the discovery that this epitope can be eliminated by changing the GLP-1 analog 

10 C-terminus to one of the following sequences: Trp-Leu-Val-Lys-Gly-Gly-Gly 
(SEQ ID NO:12); Trp-Leu-Lys-Asn-Gly-Gly-Gly (SEQ ID NO:13); Trp-Leu- 
Val-Lys-Gly-Gly-Pro (SEQ ID NO: 14); Trp-Leu-Lys-Asn-Gly-Gly-Pro (SEQ ID 
NO: 15); Trp-Leu- Val-Lys-Gly-Gly (SEQ ID NO: 16); and Trp-Leu-Lys-Asn- 
Gly-Gly (SEQ ID NO: 17). 

15 The heterologous fusion proteins of the present invention contain an Fc 

portion which is derived from human IgG4, but comprises one or more 
substitutions compared to the wild-type human sequence. As used herein, the Fc 
portion of an immunoglobulin has the meaning commonly given to the term in 
the field of immunology. Specifically, this term refers to an antibody fragment 

20 which does not contain the two antigen binding regions (the Fab fragments) from 
the antibody. The Fc portion consists of the constant region of an antibody from 
both heavy chains, which associate through non-covalent interactions and 
disulfide bonds. The Fc portion can include the hinge regions and extend 
through the CH2 and CH3 domains to the c-terminus of the antibody. The Fc 

25 portion can further include one or more glycosylation sites. 

There are five types of human immunoglobulins with different effector 
functions and pharmcokinetic properties. IgG is the most stable of the five types 
having a serum half-life in humans of about 23 days. There are four IgG 
subclasses (Gl, G2, G3, and G4) each of which have different biological 

30 functions known as effector functions. These effector functions are generally 
mediated through interaction with the Fc receptor (FcyR) or by binding Clq and 
fixing complement. Binding to FcyR can lead to antibody dependent cell 
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mediated cytolysis, whereas binding to complement factors can lead to 
complement mediated cell lysis. In designing heterologous Fc fusion proteins 
wherein the Fc portion is being utilized solely for its ability to extend half-life, it 
is important to minimize any effector function. Thus, the heterologous fusion 
5 proteins of the present invention are derived from the human IgG4 Fc region 
because of its reduced ability to bind FcyR and complement factors compared to 
other IgG sub-types. IgG4, however, has been shown to deplete target cells in 
humans [Issacs et aL, (1996) Clin. Exp. Immunol. 106:427-433]. Because the 
heterologous fusion proteins of the present invention target beta cells in the 

10 pancreas to induce insulin expression, using an IgG4 derived region in an Fc 

fusion protein could initiate an immune response against the pancreateic beta cell 
through interaction of the fusion protein with the GLP-1 receptor present on 
pancreatic beta cells. Thus, the IgG4 Fc region which is part of the heterologous 
fusion proteins of the present invention contains substitutions that eliminate 

15 effector function. The IgG4 Fc portion of the fusion proteins of the present 

invention may contain one or more of the following substitutions: substitution Of 
proline for glutamate at residue 233, alanine or valine for phenylalanine at 
residue 234 and alanine or glutamate for leucine at residue 235 (EU numbering, 
Kabat, E.A. et al. (1991) Sequences of Proteins of Immunological Interest, 5 th Ed. 

20 U.S. Dept. of Health and Human Services, Bethesda, MD, NIH Publication no. 
91-3242). These residues corresponds to positions 16, 17 and 18 in SEQ ID 
NO:7. Further, removing the N-linked glycosylation site in the IgG4 Fc region 
by substituting Ala for Asn at residue 297 (EU numbering) which corresponds to 
position 80 of SEQ ID NO:7 is another way to ensure that residual effector 

25 activity is eliminated in the context of a heterologous fusion protein. 

In addition, the IgG4 Fc portion of the heterologous fusion proteins of the 
present invention contain a substitution that stabilizes heavy chain dimer 
formation and prevents the formation of half-IgG4 Fc chains. The heterologous 
fusion proteins of the present invention preferably exist as dimers joined together 

30 by disulfide bonds and various non-covalent interactions. Wild-type IgG4 

contains a Pro-Pro-Cys-Pro-Ser-Cys (SEQ ID NO: 18) motif beginning at residue 
224 (EU numbering). This motif in a single GLP-1 analog-Fc chain forms 
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disulfide bonds with the corresponding motif in another GLP-1 analog-Fc chain. 
However, the presence of serine in the motif causes the formation of single chain 
fusion proteins. The present invention encompasses heterologous Pc fusion 
proteins wherein the IgG4 sequence is further modified such that serine at 

5 position at 228 (EU numbering) is substituted with proline (amino acid residue 
llinSEQIDNO:7). 

The C-terminal lysine residue present in the native molecule may be 
deleted in the IgG4 derivative Fc portion of the heterologous fusion proteins 
discussed herein (position 230 of SEQ ID NO:7; deleted lysine referred to as des- 

10 K). Fusion proteins expressed in some cell types (such as NS0 cells) wherein 
lysine is encoded by the C-terminal codon are heterogeneous in that a portion of 
the molecules have lysine as the C-terminal amino acid and a portion have lysine 
deleted. The deletion is due to protease action during expression in some types 
of mammalian cells. Thus, to avoid this heterogeneity, it is preferred that Fc 

15 fusion expression constructs lack a C-terminal codon for lysine. 

It is preferred that the C-terminal amino acid of the GLP-1 analog portion 
discussed herein is fused to the N-terminus of the IgG4 Fc analog portion via a 
glycine-rich linker. The in vivo function and stability of the heterologous fusion 
proteins of the present invention can be optimized by adding small peptide 

20 linkers to prevent potentially unwanted domain interactions. Further, a glycine- 
rich linker provides some structural flexibility such that the GLP-l analog 
portion can interact productively with the GLP-1 receptor on target cells such as 
the beta cells of the pancreas. These linkers, however, can significantly increase 
the risk that the fusion protein will be immunogenic in vivo. Thus, it is preferred 

25 that the length be no longer than necessary to prevent unwanted domain 

interactions and/or optimize biological activity and/or stability. The preferred 
glycine-rich linker comprises the sequence: Gly-Gly-Gly-Gly-Ser-Gly-Gly-Gly- 
Gly-Ser-Gly-Gly-Gly-Gly-Ser (SEQ ID NO:8). Although morfe copies of this 
linker may be used in the heterologous fusion proteins of the present invention, it 

30 is preferred that a single copy of this linker be used to minimize the risk of 
immunogenicity associated with prolonged and repeated administration. 



WO 2005/000892 PCT/US2004/0 15595 

-9- 

Preferred GLP-l-Fc heterologous fusion proteins of the present invention include 
the following proteins: Gly s -Glu 22 -Gly 36 -GlP-l(7-37)-lHgG4 (S228P), Gly^Glu 22 - 
Gly 36 -GLP-l(7-37)-lL-IgG4 (S228P, F234A, L235A), Gly 8 -Glu 22 -Gly 36 -GLP- 1(7-37)- 
lL-IgG4 (S228P, N297A), Gly 8 -Glu 22 -Gly 36 rGLP-l(7-37>lL-IgG4 (S228P, F234A, 

5 L235A, N297A), Gly 8 -Glu 22 -Gly 36 -GIJ>.l(7-37)-1.5HgG4 (S228P), Gly^Glu^-Gly 36 - 
GLP-l(7-37)-1.5L-IgG4 (S228P, F234A, L235A), Gly 3 ^51u 22 -<Jly ,6 -GLP-l(7-37)-1.5Lr 
IgG4 (S228P, N297A), Gly 8 -Glu 22 -Gly 36 -GLP-l(7-37)-L5L-IgG4 (S228P, F234A, 
L235A, N297A), Gly 8 -Glu 22 -Gly 36 -GLP-l(7-37)-2L-IgG4 (S228P), Gly^Glu^-Gly 36 - 
GLP-l(7-37)-2L-IgG4 (S228P, F234A, L235A), Gly 8 -Glu 22 -Gly 36 -GLP-l(7-37)-2L4gG4 

10 (S228P, N297A), and Gly 8 -Glu 22 -Gly 36 -GUP-l(7-37)-2L-IgG4 (S228P, F234A, L235A, 
N297A), and the Val 8 and des-K forms of all of the above. 

The nomenclature used herein to refer to specific heterologous fusion proteins is 
defined as follows: Specific substitutions to the GLP-1 portion of the fusion protein are 
indicated using the specific amino acid being substituted followed by the residue number. 

15 GLP-1 (7-37) indicates that the GLP-1 portion of the mature fusion protein begins with 
His at position 7 and ends with Gly at position 37. L refers to a linker with the sequence 
Gly-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser (SEQ ID NO:8). The 
number immediately preceding the L refers to the number of linkers separating the GLP-1 
portion from the Fc portion. A linker specified as 1.5L refers to the sequence Gly-Ser- 

20 Gly-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser 
(SEQ ID NO: 19) IgG4 refers to an analog of the human IgG4 Fc sequence specified as 
SEQ ID NO:7. Substitutions in the IgG4 Fc portion of the heterologous fusion protein are 
indicated in parenthesis. The wild-type amino acid is specified by its common 
abbreviation followed by the position number in the context of the entire IgG4 sequence 

25 using the EU numbering system followed by the amino acid being substituted at that 
position specified by its common abbreviation. 

Although the heterologous fusion proteins of the present invention can be made by 
a variety of different methods, because of the size of the fusion protein, recombinant 
methods are preferred. For purposes of the present invention, as disclosed and claimed 

30 herein, the following general molecular biology terms and abbreviations are defined 
below. 
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"Base pair" or "bp" as used herein refers to DNA or RNA. The abbreviations 
A,C,G, and T correspond to the 5-monophosphate forms of the deoxyribonucleosides 
(deoxy)adenosine, (deoxy)cytidine, (deoxy)guanosine, and thymidine, respectively, when 
they occur in DNA molecules. The abbreviations U,C,G, and A correspond to the 5- 
5 monophosphate forms of the ribonucleosides uridine, cytidine, guanosine, and adenosine, 
respectively when they occur in RNA molecules. In double stranded DNA, base pair may 
refer to a partnership of A with T or C with G. In a DNA/RNA, heteroduplex base pan- 
may refer to a partnership of A with U or C with G. (See the definition of 
"complementary", infra .) 

10 "Digestion" or "Restriction" of DNA refers to the catalytic cleavage of the DNA 

with a restriction enzyme that acts only at certain sequences in the DNA ("sequence- 
specific endonucleases"). The various restriction enzymes used herein are commercially 
available and their reaction conditions, cofactors, and other requirements were used as 
would be known to one of ordinary skill in the art. Appropriate buffers and substrate 

15 amounts for particular restriction enzymes are specified by the manufacturer or can be 
readily found in the literature. 

"Ligation" refers to the process of forming phosphodiester bonds between two 
double stranded nucleic acid fragments. Unless otherwise provided, ligation may be 
accomplished using known buffers and conditions with a DNA ligase, such as T4 DNA 

20 ligase. 

"Plasmid" refers to an extrachromosomal (usually) self-replicating genetic 
element. 

"Recombinant DNA cloning vector" as used herein refers to any autonomously 
replicating agent, including, but not limited to, plasmids and phages, comprising a DNA 
25 molecule to which one or more additional DNA segments can or have been added. 

"Recombinant DNA expression vector" as used herein refers to any recombinant 
DNA cloning vector in which a promoter to control transcription of the inserted DNA has 
been incorporated. 

"Transcription" refers to the process whereby information contained in a 
30 nucleotide sequence of DNA is transferred to a complementary RNA sequence. 

"Transfection" refers to the uptake of an expression vector by a host cell whether 
or not any coding sequences are, in fact, expressed. Numerous methods of transfection 
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are known to the ordinarily skilled artisan, for example, calcium phosphate co- 
precipitation, liposome transfection, and electroporation. Successful transfection is 
generally recognized when any indication of the operation of this vector occurs within the 
host cell. 

5 'Transformation" refers to the introduction of DNA into an organism so that the 

DNA is replicable, either as an extrachromosomal element or by chromosomal 
integration. Methods of transforming bacterial and eukaryotic hosts are well known in 
the art, many of which methods, such as nuclear injection, protoplast fusion or by calcium 
treatment using calcium chloride are summarized in J. Sambrook, et aL, Molecular 

10 Cloning: A Laboratory Manual, (1989). Generally, when introducing DNA into Yeast the 
term transformation is used as opposed to the term transfection. 

'Translation" as used herein refers to the process whereby the genetic information 
of messenger RNA (mRNA) is used to specify and direct the synthesis of a polypeptide 
chain. 

15 "Vector" refers to a nucleic acid compound used for the transfection and/or 

transformation of cells in gene manipulation bearing polynucleotide sequences 
corresponding to appropriate protein molecules which, when combined with appropriate 
control sequences, confers specific properties on the host cell to be transfected and/or 
transformed. Plasmids, viruses, and bacteriophage are suitable vectors. Artificial vectors 

20 are constructed by cutting and joining DNA molecules from different sources using 

restriction enzymes and ligases. The term "vector" as used herein includes Recombinant 
DNA cloning vectors and Recombinant DNA expression vectors. 

"Complementary" or "Complementarity", as used herein, refers to pairs of bases 
(purines and pyrimidines) that associate through hydrogen bonding in a double stranded 

25 nucleic acid. The following base pairs are complementary: guanine and cytosine; 
adenine and thymine; and adenine and uracil. 

"Primer" refers to a nucleic acid fragment which functions as an initiating 
substrate for enzymatic or synthetic elongation. 

"Promoter" refers to a DNA sequence which directs transcription of DNA to 

30 RNA. 

"Probe" refers to a nucleic acid compound or a fragment, thereof, which 
hybridizes with another nucleic acid compound. 
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"Leader sequence" refers to a sequence of amino acids which can be 
enzymatically or chemically removed to produce the desired polypeptide of interest 

"Secretion signal sequence" refers to a sequence of amino acids generally present 
at the N-terminal region of a larger polypeptide functioning to initiate association of that 

5 polypeptide with the cell membrane compartments like endoplasmic reticulum and 
secretion of that polypeptide through the plasma membrane. 

Wild-type human IgG4 proteins can be obtained from a variety of sources. For 
example, these proteins can be obtained from a cDNA library prepared from cells which 
express the mRNA of interest at a detectable level. Libraries can be screened with probes 

10 designed using the published DNA or protein sequence for the particular protein of 
interest. For example, immunoglobulin light or heavy chain constant regions are 
described in Adams, et al. (1980) Biochemistry 19:2711-2719; Goughet, et al. (1980) 
Biochemistry 19:2702-2710; Dolby, et al. (1980) Proc. Natl. Acad. Sci. USA 77:6027- 
6031; Rice et al. (1982) Proc. Natl. Acad. Sci. USA 79:7862-7862; Falkner, et al (1982) 

15 Nature 298:286-288; and Morrison, et al. (1984) Ann. Rev. Immunol. 2:239-256. 

Screening a cDNA or genomic library with the selected probe may be conducted 
using standard procedures, such as described in Sambrook et al., Molecular Cloning: A 
Laboratory Manual Cold Spring Harbor Laboratory Press, NY (1989). An alternative 
means to isolate a gene encoding an immunoglobulin protein is to use PCR methodology 

20 [Sambrook et al., supra; Dieffenbach et al., PCR Primer: A Laboratory Manual, Cold 
Spring Harbor Laboratory Press, NY (1995)]. PCR primers can be designed based on 
published sequences. 

Generally the full-length wild-type sequences cloned from a particular library can 
serve as a template to create the IgG4 Fc analog fragments of the present invention that 

25 retain the ability to confer a longer plasma half-life on the GLP-1 analog that is part of the 
fusion protein. The IgG4 Fc analog fragments can be generated using PCR techniques 
with primers designed to hybridize to sequences corresponding to the desired ends of the 
fragment PCR primers can also be designed to create restriction enzyme sites to 
facilitate cloning into expression vectors. 

30 DNA encoding the GLP- 1 analogs of the present invention can be made by a 

variety of different methods including cloning methods like those described above as well 
as chemically synthesized DNA. Chemical synthesis may be attractive given the short 
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length of the encoded peptide. The amino acid sequence for GLP-1 has been published as 
well as the sequence of the preproglucagon gene. [Lopez, et al. (1983) Proc. Natl. Acad. 
Sci., USA 80:5485-5489; Bell, etal. (1983) Nature, 302:716-718; Heinrich, G., etal. 
(1984) Endocrinol, 115:2176-2181; Ghiglione, M., et al. 91984) Diabetologia 27:599- 

5 600]. Thus, primers can be designed based on die native sequence to generate DNA 
encoding the GLP-1 analogs described herein. 

The gene encoding a fusion protein can then be constructed by ligating DNA 
encoding a GLP-1 analog in-frame to DNA encoding the IgG Fc proteins described 
herein. The DNA encoding wild-type GLP-1 and IgG4 Fc fragments can be mutated 

10 either before ligation or in the context of a cDNA encoding an entire fusion protein. A 
variety of mutagenesis techniques are well known in the art. The gene encoding the GLP- 
1 analog and the gene encoding the IgG4 Fc analog protein can also be joined in-frame 
via DNA encoding a G-rich linker peptide. A preferred DNA sequence encoding one of 
the preferred heterologous fusion proteins of the present invention, Gly^Glu^-Gly 36 - 

15 GLP-l(7-37)-lL-IgG4 (S228P, F234A, L235A, des K), is provided as SEQ ID NO:20: 
CACGGCGAGGGCACCTTCACCTCCGACGTGTCCTCCTATCTCGAGGAGCAGG 
CCGCCAAGGAATTCATCGCCTGGCTGGTGAAGGGCGGCGGCGGTGGTGGTGG 
CTCCGGAGGCGGCGGCTCTGGTGGCGGTGGCAGCGCTGAGTCCAAATATGGT 
CCCCCATGCCCACCCTGCCCAGCACCTGAGGCCGCCGGGGGACCATCAGTCTT 

20 CCTGTTCCCCCCAAAACCCAAGGACACTCTCATGATCTCCCGGACCCCTGAGG 
TCACGTGCGTGGTGGTGGACGTGAGCCAGGAAGACCCCGAGGTCCAGTTCAA 
CTGGTACGTGGATGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAG 
GAGCAGTTCAACAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCA 
GGACTGGCTGAACGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGGCCTC 

25 CCGTCCTCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAGC 
CACAGGTGTACACCCTGCCCCCATCCCAGGAGGAGATGACCAAGAACCAGGT 
CAGCCTGACCTGCCTGGTCAAAGGCTTCTACCCCAGCGACATCGCCGTGGAGT 
GGGAAAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCT 
GGACTCCGACGGCTCCTTCTTCCTCTACAGCAGGCTAACCGTGGACAAGAGC 

30 AGGTGGCAGGAGGGGAATGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGC 
ACAACCACTACACACAGAAGAGCCTCTCCCTGTCTCTGGGT (SEQ ID NO:20) 
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Host cells are transfected or transformed with expression or cloning vectors described 
herein for heterologous fusion protein production and cultured in conventional nutrient media 
modified as appropriate for inducing promoters, selecting transformants, or amplifying the 
genes encoding the desired sequences. The culture conditions, such as media, temperature, 
5 pH and the like, can be selected by the skilled artisan without undue experimentation. In 
general, principles, protocols, and practical techniques for maximizing the productivity of 
cell cultures can be found in Mammalian Cell Biotechnology: A Practical Approach, M. 
Butler, ed. (IRL Press, 1991) and Sambrook, etal., supra. Methods of transfection are 
known to the ordinarily skilled artisan, for example, CaP0 4 and electroporation. General 

10 aspects of mammalian cell host system transformations have been described in U.S. Patent 
No. 4,399,216. Transformations into yeast are typically carried out according to the method 
of van Solingen et d.JBact. 130(2): 946-7 (1977) and Hsiao et al., Proc. Natl. Acad Set 
USA 76(8): 3829-33 (1979). However, other methods for introducing DNA into cells, such 
as by nuclear microinjection, electroporation, bacterial protoplast fusion with intact cells, or 

15 polycations, e.g., polybrene or polyomithine, may also be used. For various techniques for 
transforming mammalian cells, see Keown, et al., Methods in Enzymology 185: 527-37 
(1990) and Mansour, et al., Nature 336(6197): 348-52 (1988). 

Suitable host cells for cloning or expressing the nucleic acid (e.g., DNA) in the 
vectors herein include yeast or higher eukaryote cells. 

20 Eukaryotic microbes such as filamentous fungi or yeast are suitable cloning or 

expression hosts for fusion protein vectors. Saccharomyces cerevisiae is a commonly used 
lower eukaryotic host microorganism. Others include Schizosaccharomyces pombe [Beach 
and Nurse, Nature 290: 140-3 (1981); EP 139,383 published 2 May 1995]; Muyveromyces 
hosts [U.S. Patent No. 4,943,529; Fleer, et al., Bio/Technology 9(10): 968-75 (1991)] such as, 

25 e.g., K lactis (MW98-8C, CBS683, CBS4574) [de Louvencourt et al., J. BacterioL 154(2): 
737-42 (1983)]; K. fiagilis (ATCC 12,424), K. bulgaricus (ATCC 16,045), K wickeramii 
(ATCC 24,178), K waltii (ATCC 56,500), K. drosophilarum (ATCC 36.906) [Van den Berg 
et al., Bio/Technology 8(2): 135-9 (1990)]; K. thermotoierans, and K. marxianus; yarrowia 
(EP 402,226); Pichia pastoris (EP 183,070) [Sreekrishna et al., J. Basic Microbiol 28(4): 

30 265-78 (1988)]; Candid; Trichoderma reesia (EP 244,234); Neurospora crassa [Case, et al., 
Proc. Natl AcadSci. USA 76(10): 5259-63 (1979)]; Schwanniomyces such as 
Schwanniomyces occidentulis (EP 394,538 published 31 October 1990); and filamentous 
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fungisuch as, e.g., Neurospora, Penicillium, Tolypocladium (WO 91/00357 published 10 
January 1991), and Aspergillus hosts such as A. nidulans [Ballance et al„ Biochem. Biophys. 
Res. Comm. 112(1): 284-9 (1983)]; Tflbum, etal., Gene 26(2-3): 205-21 (1983); Yelton, et 
aU Proc. Natl. Acad. Sci. USA 81(5): 1470-4 (1984)] and A. niger [Kelly and Hynes, EMBO 

5 J. 4(2): 475-9 (1985)]. Methylotropic yeasts are selected from the genera consisting of 

Hansenula, Candida, Kloeckera, Pichia, Saccharomyces, Torulopsis, and Rhodotoruia. A list 
of specific species that are exemplary of this class of yeast may be found in C. Antony, The 
Biocliemistry ofMethylotrophs 269 (1982). 

Suitable host cells for the expression of the fusion proteins of the present invention 

10 are derived from multicellular organisms. Examples of invertebrate cells include insect cells 
such as Drosophila S2 and Spodoptera Sp, Spodoptera highS as well as plant cells. Examples 
of useful mammalian host cell lines include NSO myeloma cells, Chinese hamster ovary 
(CHO), SP2, and COS cells. More specific examples include monkey kidney CV1 line 
transformed by SV40 (COS-7, ATCC CRL 1651); human embryonic kidney line [293 or 293 

15 cells subcloned for growth in suspension culture, Graham, et al. 9 J. Gen Virol, 36(1): 59-74 
(1977)]; Chinese hamster ovary cells/-DHFR [CHO, Urlaub and Chasin, Proc. Natl. Acad. 
Sci. USA, 77(7): 4216-20 (1980)]; mouse Sertoli cells [TM4, Mather, Biol Reprod. 
23(l):243-52 (1980)]; human lung cells (W138. ATCC CCL 75); human liver cells (Hep G2, 
HB 8065); and mouse mammary tumor (MMT 060562, ATCC CCL51). A preferred celLline 

20 for production of the Fc fusion proteins of the present invention is the NSO myeloma cell line 
available from the European Collection of Cell Cultures (ECACC, catalog #85110503) and 
described in Galfre, G. and Milstein, C ((1981) Methods in Enzymology 73(13):3-46; and 
Preparation of Monoclonal Antibodies: Strategies and Procedures, Academic Press, N.Y., 
N.Y.). 

25 The fusion proteins of the present invention may be recombinantly produced directly, 

or as a protein having a signal sequence or other additional sequences which create a specific 
cleavage site at the N-terminus of the mature fusion protein. In general, the signal sequence 
may be a component of the vector, or it may be a part of the fusion protein-encoding DNA 
that is inserted into the vector. For yeast secretion the signal sequence may be, e.g., the yeast 

30 invertase leader, alpha factor leader (including Saccharomyces and Kluyveromyces cc-factor 
leaders, the latter described in U.S. Patent No. 5,010,182), or acid phosphatase leader, the C. 
albicans glucoamylase leader (EP 362,179), or the signal described in WO 90/13646. In 
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mammalian cell expression, mammalian signal sequences may be used to direct secretion of 
the protein, such as signal sequences from secreted polypeptides of the same or related 
species as well as viral secretory leaders. 

Both expression and cloning vectors contain a nucleic acid sequence that enables the 

5 vector to replicate in one or more selected host cells. Expression and cloning vectors will 
typically contain a selection gene, also termed a selectable marker. Typical selection genes 
encode proteins that (a) confer resistance to antibiotics or other toxins, e.g., neomycin, 
methotrexate, or tetracycline, (b) complement autotrophic deficiencies, or (c) supply critical 
nutrients not available from complex media, e.g., the gene encoding D-alanine racemase for 

10 Bacilli. 

An example of suitable selectable markers for mammalian cells are those that enable 
the identification of cells competent to take up the fusion protein-encoding nucleic acid, such 
as DHFR or thymidine kinase. An appropriate host cell when wild-type DHFR is employed 
is the CHO cell line deficient in DHFR activity, prepared and propagated as described 

15 [Urlaub and Chasin, Proc. Natl Acad ScL USA, 77(7): 4216-20 (1980)]. A suitable 
selection gene for use in yeast is the trpl gene present in the yeast plasmid YRp7 
[Stinchcomb, et aL, Nature 282(5734): 39-43 (1979); Kingsman, et aL, Gene 7(2): 141-52 
(1979); Tschumper, et aL, Gene 10(2): 157-66 (1980)]. The trpl gene provides a selection 
marker for a mutant strain of yeast lacking the ability to grow in tryptophan, for example, 

20 ATCC No. 44076 or PEPC1 [Jones, Genetics 85: 23-33 (1977)]. 

Expression and cloning vectors usually contain a promoter operably linked to the 
fusion protein-encoding nucleic acid sequence to direct mRNA synthesis. Promoters 
recognized by a variety of potential host cells are well known. Examples of suitable 
promoting sequences for use with yeast hosts include the promoters for 3-phosphoglycerate 

25 kinase [Hitzeman, et aL, J. Biol Chem. 255(24): 12073-80 (1980)] or other glycolytic 
enzymes [Hess et aL, J. Adv. Enzyme Reg. 7: 149 (1968); Holland, Biocliemistry 17(23): 
4900-7 (1978)], such as enolase, glyceraldehyde-3-phosphate dehydrogenase, hexokinase, 
pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 3- 
phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose 

30 isomerase, and glucokinase. Other yeast promoters, which are inducible promoters having 
the additional advantage of transcription controlled by growth conditions, are the promoter 
regions for alcohol dehydrogenase 2, isocytochrome C, acid phosphatase, degradative 
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enzymes associated with nitrogen metabolism, metallothionein, glyceraldehyde-3-phosphate 
dehydrogenase, and enzymes responsible for maltose and galactose utilization. Suitable 
vectors and promoters for use in yeast expression are further described in EP 73,657. 
Transcription of fusion protein-encoding mRNA from vectors in mammalian host cells may 

5 be controlled, for example, by promoters obtained from the genomes of viruses such as 
polyoma virus, fowlpox virus, adenovirus (such as Adenovirus 2), bovine papilloma virus, 
avian sarcoma virus, cytomegalovirus, a retrovirus, hepatitis-B virus and Simian Virus 40 
(SV40), from heterologous mammalian promoters, e.g., the actin promoter or an 
immunoglobulin promoter, and from heat-shock promoters, provided such promoters are 

10 compatible with the host cell systems. 

Transcription of a polynucleotide encoding a fusion protein by higher eukaryotes may 
be increased by inserting an enhancer sequence into the vector. Enhancers are cis-acting 
elements of DNA, usually about from 10 to 300 bp, that act on a promoter to increase its 
transcription. Many enhancer sequences are now known from mammalian genes (globin, 

15 elastase, albumin, a-ketoprotein, and insulin). Typically, however, one will use an enhancer 
from a eukaryotic cell virus. Examples include the SV40 enhancer on the late side of the 
replication origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma 
enhancer on the late side of the replication origin, and adenovirus enhancers. The enhancer 
may be spliced into the vector at a position 5' or 3' to the fusion protein coding sequence but 

20 is preferably located at a site 5' from the promoter. 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, 
human, or nucleated cells from other multicellular organisms) will also contain sequences 
necessary for the termination of transcription and for stabilizing the mRNA. Such sequences 
are commonly available from the 5' and occasionally 3' untranslated regions of eukaryotic or 

25 viral DNAs or cDNAs. These regions contain nucleotide segments transcribed as 

polyadenylated fragments in the untranslated portion of the mRNA encoding the fusion 
protein. 

Various forms of a fusion protein may be recovered from culture medium or from 
host cell lysates. If membrane-bound, it can be released from the membrane using a suitable 
30 detergent solution (e.g., Triton-X 100) or by enzymatic cleavage. Cells employed in 

expression of a fusion protein can be disrupted by various physical or chemical means, such 
as freeze- thaw cycling, sonication, mechanical disruption, or cell lysing agents. 
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Once the heterologous fusion proteins of the present invention are expressed in the 
appropriate host cell, the analogs can be isolated and purified. The following procedures are 
exemplary of suitable purification procedures: fractionation on carboxymethyl cellulose; gel 
filtration such as Sephadex G-75; anion exchange resin such as DEAE or Mono-Q; cation 
5 exchange such as CM or Mono-S; metal chelating columns to bind epitope-tagged forms of 
the polypeptide; reversed-phase HPLC; chromatofocusing; silica gel; ethanol precipitation; 
and ammonium sulfate precipitation. 

Various methods of protein purification may be employed and such methods are 
known in the art and described, for example, in Deutscher, Metliods in Enzymology 182: 

10 83-9 (1990) and Scopes, Protein Purification: Principles and Practice, Springer- Verlag, 
NY (1982). The purification step(s) selected will depend on the nature of the production 
process used and the particular fusion protein produced. For example, fusion proteins 
comprising an Fc fragment can be effectively purified using a Protein A or Protein G 
affinity matrix. Low or high pH buffers can be used to elute the fusion protein from the 

15 affinity matrix. Mild elution conditions will aid in preventing irreversible denaturation of 
the fusion protein. 

The heterologous fusion proteins of the present invention may be formulated with one 
or more excipients. The fusion proteins of the present invention may be combined with a 
pharmaceutically acceptable buffer, and the pH adjusted to provide acceptable stability, and a 

20 pH acceptable for administration such as parenteral administration. Optionally, one or more 
pharmaceutically-acceptable anti-microbial agents may be added. Meta-cresol and phenol 
are preferred pharmaceuticallyracceptable microbial agents. One or more pharmaceutically- 
acceptable salts may be added to adjust the ionic strength or tonicity. One or more excipients 
may be added to further adjust the isotonicity of the formulation. Glycerin is an example of 

25 an isotonicity-adjusting excipient. Pharmaceutically acceptable means suitable for 
administration to a human or other animal and thus, does not contain toxic elements or 
undesirable contaminants and does not interfere with the activity of the active compounds 
therein. 

The heterologous fusion proteins of the present invention may be formulated as a 
30 solution formulation or as a lyophilized powder that can be reconstituted with an appropriate 
diluent. A lyophilized dosage form is one in which the fusion protein is stable, with or 
without buffering capacity to maintain the pH of the solution over the intended in-use shelf- 
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life of the reconstituted product It is preferable that the solution comprising the 
heterologous fusion proteins discussed herein before lyphilization be substantially isotonic to 
enable formation of isotonic solutions after reconstitution. 

A phannaceutically-acceptable salt form of the heterologous fusion proteins of the 

5 present invention are within the scope of the invention. Acids commonly employed to form 
acid addition salts are inorganic acids such as hydrochloric acid, hydrobromic acid, hydriodic 
acid, sulfuric acid, phosphoric acid, and the like, and organic acids such as ^toluenesulfonic 
acid, methanesulfonic acid, oxalic acid, p-bromophenyl-sulfonic acid, carbonic acid, succinic 
acid, citric acid, benzoic acid, acetic acid, and the like. Preferred acid addition salts are those 

10 formed with mineral acids such as hydrochloric acid and hydrobromic acid. 

Base addition salts include those derived from inorganic bases, such as 
ammonium or alkali or alkaline earth metal hydroxides, carbonates, bicarbonates, and the 
like. Such bases useful in preparing the salts of this invention thus include sodium 
hydroxide, potassium hydroxide, ammonium hydroxide, potassium carbonate, and the 

15 like. 

The heterologous fusion proteins of the present invention have biological activity. 
Biological activity refers to the ability of the fusion protein to bind to and activate the GLP-1 
receptor in vivo and elicit a response. Responses include, but are not limited to, secretion of 
insulin, suppression of glucagon, inhibition of appetite, weight loss, induction of satiety, 

20 inhibition of apoptosis, induction of pancreatic beta cell proliferation, and differentiation of 
pancreatic beta cells. A representative number of GLP-1 fusion proteins were tested for in 
vitro as well as in vivo activity. Examples 1 and 2 provide in vitro activity based on the 
ability of the fusion protein to interact with and activate the human GLP-1 receptor. In both 
sets of experiments, HEK293 cells over-expressing the human GLP-1 receptor were used. 

25 Activation of the GLP-1 receptor in these cells causes adenylyl cyclase activation which in 
turn induces expression of a reporter gene driven by a cyclic AMP response element (CRE). 
Example 1 (table 1) provides data wherein the reporter gene is beta lactamase, and example 2 
(table 2) provides data wherein the reporter gene is luciferase. Example 3 provides data 
generated after administration of one of the heterologous fusion proteins of the present 

30 invention to rats. Together the data show that the fusion proteins are able to bind to and 
activate the GLP-1 receptor and appear more potent in vitro than Val 8 -GLP-l(7-37)OH. In 
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addition, the data generated in rats indicate the fusion proteins are active in vivo and have a 

longer half-life than native GLP-1. 

Administration of the heterogeneous fusion proteins may be via any route known to 

be effective by the physician of ordinary skill. Peripheral parenteral is one such method. 
5 Parenteral administration is commonly understood in the medical literature as the injection of 

a dosage form into the body by a sterile syringe or some other mechanical device such as an 

infusion pump. Peripheral parenteral routes can include intravenous, intramuscular, 

subcutaneous, and intraperitoneal routes of administration. 

The heterologous fusion proteins of the present invention may also be amenable to 
10 administration by oral, rectal, nasal, or lower respiratory routes, which are non-parenteral 

routes. Of these non-parenteral routes, the lower respiratory route and the oral route are 

preferred. 

The fusion proteins of the present invention can be used to treat a wide variety of 
diseases and conditions. The fusion proteins of the present invention primarily exert their 
15 biological effects by acting at a receptor referred to as the "GLP-1 receptor." Subjects with 
diseases and/or conditions that respond favorably to GLP-1 receptor stimulation or to the 
administration of GLP-1 compounds can therefore be treated with the GLP-1 fusion proteins 
of the present invention. These subjects are said to "be in need of treatment with GLP-1 
compounds" or "in need of GLP-1 receptor stimulation". Included are subjects with non- 
20 insulin dependent diabetes, insulin dependent diabetes, stroke (see WO 00/16797), 

myocardial infarction (see WO 98/08531), obesity (see WO 98/19698), catabolic changes 
after surgery (see U.S. Patent No. 6,006,753), functional dyspepsia and irritable bowel 
syndrome (see WO 99/64060). Also included are subjects requiring prophylactic treatment 
with a GLP-1 compound, e.g., subjects at risk for developing non-insulin dependent diabetes 
25 (see WO 00/07617). Subjects with impaired glucose tolerance or impaired fasting glucose, 
subjects whose body weight is about 25% above normal body weight for the subject's height 
and body build, subjects with a partial pancreatectomy, subjects having one or more parents 
with non-insulin dependent diabetes, subjects who have had gestational diabetes and subjects 
who have had acute or chronic pancreatitis are at risk for developing non-insulin dependent 
30 diabetes. 

An effective amount of the GLP-1 -Fc fusion proteins described herein is the 
quantity which results in a desired therapeutic and/or prophylactic effect without causing 
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unacceptable side-effects when administered to a subject in need of GLP-1 receptor 
stimulation. A "desired therapeutic effect" includes one or more of the following: 1) an 
amelioration of the symptom(s) associated with the disease or condition; 2) a delay in the 
onset of symptoms associated with the disease or condition; 3) increased longevity 

5 compared with the absence of the treatment; and 4) greater quality of life compared with 
the absence of the treatment. For example, an "effective amount" of a GLP-l-Fc fusion 
protein for the treatment of diabetes is the quantity that would result in greater control of 
blood glucose concentration than in the absence of treatment, thereby resulting in a delay 
in the onset of diabetic complications such as retinopathy, neuropathy or kidney disease. 

10 An "effective amount" of a GLP-1 -Fc fusion protein for the prevention of diabetes is the 
quantity that would delay, compared with the absence of treatment, the onset of elevated 
blood glucose levels that require treatment with anti-hypoglycaernic drugs such as 
sulfonyl ureas, thiazolidinediones, insulin and/or bisguanidines. 

The dose of fusion protein effective to normalize a patient's blood glucose will 

15 depend on a number of factors, among which are included, without limitation, the subject's 
sex, weight and age, the severity of inability to regulate blood glucose, the route of 
administration and bioavailability, the pharmacokinetic profile of the fusion protein, the 
potency, and the formulation. Doses may be in the range of 0.01 to 1 mg/kg body weight, 
preferably in the range of 0.05 to 0.5 mg/kg body weight. 

20 It is preferable that the fusion proteins of the present invention be administered 

either once every two weeks or once a week. Depending on the disease being treated, it 
may be necessary to administer the fusion protein more frequently such as two to three 
time per week. 

The present invention will now be described only by way of non-limiting example 
25 with reference to the following Examples. 

EXAMPLES 

Example 1 - In vitro GLP-1 receptor activation assay 
30 HEK-293 cells expressing the human GLP-1 receptor, using a CRE-BLAM 

system, are seeded at 20,000 to 40,000 cells/well/100 jxl DMEM medium with 10%FBS 
into a poly-d-lysine coated 96 well black, clear-bottom plate. The day after seeding, the 
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medium is flicked off and 80 |il plasma-free DMEM medium is added. On the third day 
after seeding, 20 |il of plasma-free DMEM medium with 0.5% BSA containing different 
concentrations of various GLP-l-Fc heterologous fusion protein is added to each well to 
generate a dose response curve. Generally, fourteen dilutions containing from 3 

5 nanomolar to 30 nanomolar or heterologous GLP-1 Fc fusion protein are used to generate 
a dose response curve from which EC50 values can be determined. After 5 hours of 
incubation with the fusion protein, 20 pi of P-lactanaase substrate (CCF2/AM, PanVera 
LLC) is added and incubation continued for 1 hour at which time fluorescence is 
determined on a cytofluor. The assay is further described in Zlokamik, et al. (1998), 

10 Science, 278:84-88. Various GLP-1 -Fc fusion proteins are tested and EC50 values are 
represented in Table 1. The values are relative to values determined for Val 8 -GLP-1(7- 
37)OH which is run as an internal control with every experiment 

Table 1 

15 Compound Activity Std. Dev. 
Val 8 -GLP-1: 100% 
Gly 8 -Glu 22 -GLP-l(7-37)-2L-IgG4 (S228P, F234A, L235A): 301 % 99 
Gly 8 -Glu 22 -GLP-l(7-37)-1.5L-IgG4 (S228P, F234A, L235A): 314% 45 
Gly 8 -Glu 22 -GLP-l(7-37)-lL-IgG4 (S228P, F234A, L235A): 468% 120 

20 Gly 8 -Glu 22 -Gl^ 6 -GU>-l(7-37)-2L-IgG4(S228P,F234A,L235A): 441% 35 

Example 2 - In vitro GLP-1 receptor activation assay 

HEK-293 cells stably expressing the human GLP-1 receptor, using a CRE- 
Luciferase system, are seeded at 30,000 cells/well/80 pi low serum DMEM F12 medium 

25 into 96 well plates. The day after seeding, 20 |jl aliquots of test protein dissolved in 0.5% 
BSA are mixed and incubated with the cells for 5 hours. Generally 12 dilutions 
containing from 3 pM to 3 nM are prepared at a 5X concentration for each test protein 
before addition to the cells to generate a dose response curve from which EC50 values are 
determined. After incubation, 100 |nl of Luciferase reagent is added directly to each plate 

30 and mixed gently for 2 minutes. Plates are placed in a Tri-lux luminometer and light 
output resulting from luciferase expression is calculated. Various GLP-l-Fc fusion 
proteins are tested and EC50 values are represented in Table 2. The values are relative to 
values determined for Val 8 -GLP-l(7-37)OH which is run as an internal control with every 
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experiment. Because the fusion proteins tested below are dimers, values are corrected 
taking into account a 2-fold difference in molarity. 

Table 2 

5 

Compound Activity Std. Dev. 

VaI 8 -GLP-l: 100% 
Gly 8 -Glu 22 -GLP- l(7-37)-2L-IgG4 (S228P, F234A, L235A): 535% 240 
Gly 8 -Glu 22 -GUP~l(7-37)-1.5HgG4 (S228P, E234A, L235A): 595% 43 
10 Gly 8 -Glu 22 -GLP-l(7-37>lL-IgG4 (S228P, F234A, L235A): 1119% 128 
Gly 8 -Glu 22 -Gly 36 -GLP-l(7-37>2L-IgG4 (S228P, F234A, L235A): 398% 62 
Gly 8 -Glu 22 ~Gly 36 -GLP-l(7-37)-lL-IgG4 (S228P, F234A, L235A): 417% 140 

Example 3 Intravenous Glucose Tolerance Test in Rats 
15 The Fc fusion protein, Gly 8 -Glu 22 -Gly 36 -GLP-l(7-37)-l^IgG4 

(S228P,F234A,L235A), is evaluated in an intravenous glucose tolerance test (IVGTT) in 
rats. At least four rats are included into each of three groups. Group I receives vehicle 
(table 3), Group II receives 1.79 mg/kg of Gly 8 -Glu 22 -Gly 36 -GIP-l(7-37)-L-IgG4 
(S228PJF234A,L235A) as a single subcutaneous injection (table 4), and Group III 
20 receives 0.179 mg/kg of Gly 8 -Glu 22 -Gly 36 -GLP-l(7-37)-L-IgG4 (S228P,F234A,L235A) 
as a single subcutaneous injection (table 5). Rats are subcutaneously injected the morning 
of Day 1. Twenty-four hours following the first injection, 1 [iL of glucose (D50) per 
gram rat body weight is infused as a bolus. Blood samples are taken at 2, 4, 6, 10, 20, and 
30 minutes following the bolus infusion of glucose. 
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Vehicle: 


Rati 


Rat 2 


Rat 3 


Rat 4 


Rat5 




Insulin AUC (ng+min/mL) 










Averaqe 


0-2 


11 


9.4 


7 


11 


9.6 


2-4 


18.1 


9.7 


5.6 


10.6 


8.8 


4-6 


13.4 


7 


3.4 


9.6 


5.9 


6-10 


7.9 


3.5 


2.5 


6 


2.9 


10-20 


3.7 


3 


2.4 


3 


2.4 


20-30 


2 


0 


0 


0 


2.4 


sum 


56.1 


32.6 


20.9 


40.2 


32 


36.4 


Table 4 














GLP-1-Fc 














(1.79mg/kg) 


Rati 


Rat 2 


Rat 3 


Rat 4 


Rat 5 




Insulin AUC (ng*min/mL) 










Averaoe 


0-2 


12.3 


17.4 


16 


14 


13 




2-4 


21.9 


13.3 


13.2 


13.9 


13.6 




4-6 


16.8 


6.5 


9.8 


11.1 


11.7 




6-10 


7.6 


3.8 


9.2 


5.8 


7.4 




10-20 


3 


0 


0 


3.2 


5.6 




20-30 


0 


0 


0 


0 


0 




sum 


61.6 


41 


48.2 


48 


51.3 




50 


Table5 














GLP-1-Fc 














(0.179mg/kg) 


Rati 


Rat 2 


Rat 3 


Rat 4 






Insulin AUC (ng*min/mL) 








Average 


SEM 


0-2 


14.4 


29.2 


25.4 


23.2 








2-4 


13.8 


26.3 


21.2 


21.8 








4-6 


11.2 


19.4 


16.4 


15.7 








6-10 


6.4 


10.6 


10.5 


8 








10-20 


3.6 


5.8 


5.2 


5 








20-30 


0 


0 


0 


0 








sum 


49.4 


91.3 


78.7 


73.7 




78.7 





SEM 



5.8 



SEM 



3.4 



8.7 
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Example 4 Pharmacokinetic Study Following a Single Subcutaneous I n jection to 
Cvnomolgus Monkevs. 

A study is performed to characterize the pharmacokinetics (PK) of the Fc fusion 
protein, Gly 8 -Glu 22 -Gly 36 -GlP-l(7-37)-Hgq4 (S228P,F234A,L235A), when 

5 administered as a 0. 1 mg/kg by subcutaneous (SC) injection to male cynomolgus 
monkeys. RIA antibody is specific for the middle portion of GLP. ELISA uses an N- 
terminus specific capture antibody and an Fc specific detection antibody. 
Resulting plasma concentrations from both the ELISA and the RIA are used to determine 
the represented pharmacokinetic parameter values. 

10 A representation of the resulting PK parameter values is summarized in table 6. 

Single-dose SC PK from the RIA is associated with a mean Qnax of 446.7 ng/mL with a 
corresponding T max of 17.3 hours. The mean elimination half-life is approximately 79.3 
hours (3.3 days). The PK from the ELISA is associated with a mean of 292.2 ng/mL 
with a corresponding T^x of 16.7 hours. The mean elimination half-life is approximately 

15 51.6 hours (2.2 days). 

Table 6 



RIA 


Dose 
(mg/kg) 


Animal # 


C a 

'-'max 

(ng/mL) 


rp b 

linax 

(h) 


AUCW 
(ng*h/mL) 


t I/2 d 
(h) 


CIVF* 
(mlVh/kg) 


Vss/F" 
(mL/kg) 


0.1 


96051 


461.0 


4.0 


37770.5 


81.0 


2.7 


309.2 




96071 


430.0 


24.0 


43150.2 


74.2 


2.3 


248.1 




96091 


449.0 


24.0 


62271.1 


82.9 


1.6 


191.9 


RIA 


Mean 


446.7 


17.3 


47730.6 


79.3 


2.2 


249.8 




SD 


15.6 


11.5 


12876.5 


4.5 


0.5 


58.7 


ELISA 




96051 


315.4 


2.0 


9062.3 


55.2 


11.0 


879.4 




96071 


289.4 


24.0 


16653.0 


50.3 


6.0 


436.0 




96091 


271.9 


24.0 


19907.4 


493 


5.0 


357.0 


ELISA 


Mean 


292.2 


16.7 


15207.6 


51.6 


7.3 


557.5 




SD 


21.9 


12.7 


5565.2 


3.2 


3.2 


281.6 



a Maximum observed plasma concentration. 
b Time of maximum observed plasma concentration. 



20 c Area under the plasma concentration-time curve measured from 0 to infinity. 

d Elimination half-life. 

e Total body clearance as a function of bioavailability. 
f Volume of distribution as a function of bioavailability. 
SD = Standard deviation. 



25 
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Example 5 Assessment of the potential formation of antibodies following repeat 
subcutanesous injections. 

Designated serum samples from cynomolgus monkeys are tested for the formation 
of antibodies against Gly 8 -Glu 22 -Gly 36 -GLP-l(7-37)-L-IgG4 (S228PJF234A t L235A) 

5 using a direct ELISA format . Microtiter plates are coated with Gly^Glu^-Gly^-GLP- 
l(7-37)-L-IgG4 (S228P,F234A,L235A) a t a 0.1 |Xg/mL concentration. Monkey serum 
samples are diluted 50, 500,1000 and 5000 fold into blocking solution, and 0.05 mL 
sample/well are incubated approximately one hour. Secondary antibody, Goat <Human 
Fab'2>-Peroxidase (with 75% cross reactivity to human), is diluted 10,000 fold into block 

10 and added at 0.05 mL/well and incubated approximately one hour. Color development 
using tetramethylbenzidine (TMB) substrate is read at an optical density of 450nm - 
630nm. Duplicate readings are averaged. A GLP-1 antibody was used as a positive 
control and goat<rabbit>(Hn-L)-Peroxidase conjugate is the secondary used for detection. 
Point serum samples are collected prior to dosing, at 24 hours following the second dose, 

15 and 168 hours following the first and second SC dose for an evaluation of potential 

immunogenicity. The presence of antibody titers to G8E22-CEX-L-bIgG4 is interpreted 
by comparison to predose serum samples and positive control. A representation of the 
results is presented in table 7. 



Table 7 



Dosel 
Animal# 


Positive 
Control 


107774 




107777 




107779 




IO7780 




Sample 
Time: 




Predose 


168 b 


Predose 


168 h 


Predose 


168 h 


Predose 


168 h 


50x 


2.854 


0268 


0268 


0.160 


0.128 


0.144 


0.152 


0264 


0224 


500x 


2270 


0.117 


0.133 


0.052 


0.069 


0.065 


0.061 


0.067 


0.061 


lOOOx 


1.610 


0.091 


0.075 


0.034 


0.051 


0.047 


0.045 


0.138 


0.049 


5000x 


0.525 


0.056 


0.048 


0.032 


0.037 


0.029 


0.033 


. 0.051 


0.039 






















Dose 2 
Animal# 


Positive 
Control 


107774 




IOT777 




107779 




IO7780 




Sample 
Time: 




Predose 


24 h 


Predose 


24 h 


Predose 


24b 


Predose 


24b 


50x 


3.056 


0.298 


0231 


0.164 


0.159 


0227 


0.176 


0211 


0.192 


500x 


2247 


0.120 


0.119 


0.048 


0.045 


0.061 


0.060 


0.056 


0.057 


lOOOx 


1.673 


0.090 


0.086 


0.039 


0.041 


0.046 


0.045 


0.043 


0.048 


5000x 


0.534 


0.039 


0.042 


0.030 


0.034 


0.033 


0.036 


0.033 


0.034 






















Dose 2 
Animal# 


Positive 
Control 


IOT774 




107777 




107779 




IO7780 




Sample 
Time: 




Predose 


168 h 


Predose 


168 b 


Predose 


168 b 


Predose 


168 b 


50% 


3.075 


0.413 


0270 


0.174 


0.182 


0.185 


0.190 


0224 


0.191 


SQOx 


2.173 


0.097 


0.103 


0.042 


0.051 


0.056 


0.057 


0.048 


0.053 


lOOOx 


1.510 


0.066 


0.067 


0.038 


0.040 


0.037 


0.046 


0.043 


0.043 


5OO0X 


0.474 


0.042 


0.042 


0.033 


0.046 


0.033 


0.033 


0.036 


0.041 



20 
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Examnle 6 Pharmacodynamic Study Following a Single Subcutaneously Injection to 
Cvnomolgus Monkeys.in the Fasting State and During a Graded Intravenous Glucose 
Infusion. 

In Phase 1 (Study Day 1) a subcutaneous injection of vehicle is administered. A 
5 graded intravenous glucose (20% dextrose) infusion of 5, 10, and 25 mg/kg/min is then 
administered immediately after the vehicle injection. In Phase 2 (Study Day 3), a 
subcutaneous injection of a GLP-1 fusion protein (0.1 mg/kg) is administered. In Phase 
3, a graded intravenous glucose infusion is performed approximately 96 hours following 
the GLP-1 fusion injection. 
10 Graded intravenous glucose infusion procedures are conducted in sedated 

monkeys after a 16-hr overnight fast. For both intravenous glucose infusions, baseline 
samples will be drawn every 10 min for 20 min to define baseline. A stepped-up glucose 
infusion is initiated at +20 min at a rate of 5 mg/kg/min, followed by infusions of 10 
mg/kg/min, and 25 mg/kg/min. Each infusion rate is administered for a period of 20 
15 minutes. Blood samples are taken at 10 minute intervals for measurement of glucose, 

insulin, and glucagon. Approximately 1.0 mL of blood is collected at -20, -10 min, 0 pre- 
glucose infusions, and at 10, 20, 30, 40, 50, and 60 minutes following glucose infusion 
for Phases 1 and 3. 

A representation of the data are shown in table 8. 

20 
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Table 8 



Glucose AUC 








AUC 






AUC 


Group 


Animal 


(min*mg/dL) 


Group 


Animal 


(min*mg/dt_) 


GLP-Fc . 


9423 


7447 


vehicle 


9423 


8077 




9424 


7470 




9424 


15006 




9510 


5153 




9510 


7116 




9513 


6303 




9513 


7459 




9516 


5413 




9516 


8728 




9530 


5240 




9530 


7863 










N 


6 




Mean 


6171 




Mean 


9041 




SD 


1078 




SD 


2973 




SE 


440 




SE 


1214 


Insulin AUC 








AUC 






AUC 


Group 


Animal 


(min*ng/mL) 


Group 


Animal 


(min*ng/mL) 


GLP-Fc 


9423 


129 


vehicle 


9423 


38 




9424 


138 




9424 


29 




9510 


357 




9510 


69 




9513 


161 




9513 


64 




9516 


376 




9516 


38 




9530 


215 




9530 


68 




Mean 


229 




Mean 


51 




SD 


111 




SD 


18 




SE 


45 




SE 


7 



Glucagon levels were not statistically different between the vehicle and the GLP-1 fusion 



protein dosed monkeys. 

5 

Example 7 Pharmacodynamic Study Following Single Subcutaneouslv Injections of Three 
Different Doses to Rats in the Fasting State and During a Graded Intravenous Glucose 
Infusion. 

Chronically cannulated rats are assigned to either vehicle control (saline) or one of 
10 3 treatment groups (GLP-1 fusion protein; 0.0179 mg/kg, 0.179 mg/kg, or 1.79 mg/kg). 
The GLP-1 fusion protein and vehicle are administered via subcutaneous injection. 
Twenty-four hours after treatment, overnight fasted (16h) rats are subjected to a graded 
intravenous glucose infusion test. The graded glucose infusion test consists of a baseline 
saline infusion period (20 min), followed by two 30 min glucose infusion phases at 5 and 
15 15 mg/kg/min, respectively. Plasma samples are collected at -20, -10 min, 0 pre-glucose 
infusions (baseline), and at 10, 20, 30, 40, 50, and 60 minutes. 
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A representation of the data are shown in table 9. 



Table 9 





5mgfKg/min 


15mg/Kg/min 


Vehicle 


4.3 ±0.2 (n=18) 


12,7 ±0.9 (n=18) 


0.0179 mg/Kg 


5.6±0.4(n=4) 


15.9 ± 1.8 (n=4) 


0.179 mg/Kg 


9.0±l.l*(n=6) 


28.0 ±3.8* (n=6) 


1.79rog/Kg 


20.5 ± 3.0 * (n=4) 


52.7 ±7.2* (n=4) 



*P < 0.05 versus vehicle 
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We Claim: 

1 . A heterologous fusion protein comprising a GLP-1 analog comprising a sequence 
selected from the group consisting of: 

a) (SEQ!DNO:l) 

His-Xaa 8 -Glu-Gly-TTir-Phe-^ 

Gln-Ala-Ala-Lys-Glu-Phe-ne-Ala-Trp-Leu-Val-Lys-Gly-Gly-Gly 
wherein Xaag is selected from Gly and Val; 

b) (SEQIDNO:2) 

His-Xaas-Glu-Gly-Thr-Phe-T^ 

Gln-Ala-Ala-Lys-Glu-Phe-fle-Ala-Trp-Leu-Lys-Asn-Gly-Gly-Gly 
wherein Xaag is selected from Gly and Val; 

c) (SEQ ID NO:3) 

His-Xaag-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Ixu-Glu-GIu^ 
Gln-Ala~Ala-Lys-Glu-Phe-ne-Ala-Tip-Leu-Val-Lys-Gly-Gly-Pro 
wherein Xaa 8 is selected from Gly and Val; 

d) (SEQIDNO:4) 

His-Xaa 8 -Glu-Gly-Thr-Phe-Thr~Ser-A^ 

Gin- Ala-Ala-Lys-Glu-Phe-De-Ala-Tip-Leu-Lys-Asn-Gly-Gly-Pro 
wherein Xaa 8 is selected from Gly and Val; 

e) (SEQIDNO:5) 

ffis-Xaas-Glu-Gly-Thr-Phe-Tto 

Gln-Ala-Ala-Lys-Glu-Phe-ne-Ala-Trp-Leu-Val-Lys-Gly-Gly 
wherein Xaag is selected from Gly and Val; 

f) (SEQIDNO:6) 

ffis-Xaas-Glu-Gly-Thr-Phe-Thr-Se^^ 
Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-Trp-Leu-Lys-Asn-Gly-Gly 
wherein Xaag is selected from Gly and Val; 

fused to the Fc portion of an immunoglobulin comprising the sequence of SEQ ID 
NO:7 

Ala-Glu-Ser-Lys-Tyr-Gly-PYo-Pro-Cys-Pro-Pro-Cys-Pro-Ala-Pro- 

Xaai6-Xaai7-Xaai8-Gly-Gly-Pro-Ser-Va^ 

Lys-Asp-Thr-Leu-Met-ne-Ser-^ 
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Val-Val-Asp-Val-Ser-Gln-Glu-Asp4^ 

Tyr-Val-Asp-Gly-Val-Glu^ 

Glu-Glu-Gln-Phe-Xaago-Ser-Thr-Ty^ 

Vd-I^u-ffis-Gln-Asp-Trp-I^u-Asn-Gly-Lys-Glu-Tyr-Lys-Cys-Lys- 
Val-Ser-Asn-Lys-Gly-Ixu-Pro-Ser-Se^ 

Lys-Ala-Lys-Gly-Gln-Pro~Arg-Glu-Pro-Gln-Val-Tyr-Thr~Leu-Pro- 

Pro-Ser-Gln-Glu-Glii-Met-Thr-Lys~Asn-Gln-Val-Ser-lxu-Thr-Cys- 

I^u-Val4.ys-Gly-Phe-Tyr-Pro^ 

Ser-Asn-Gly-Gln-Pro-Glu-Asn-Asn-Tyr-Lys/Thr- 

I^u-Asp-Ser-Asp-Gly-Ser-Phe-Phe-Ixu-Tyr-Ser-Ajg-lxu-Thr-Val- 

Asp-Lys-Ser-Arg-Trp-Gln-Glu-Gly-Asn-Val-Phe-Ser-Cys-Ser-Val- 

Met-His-Glu-Ala-Ixu-His-Asn-His-Tyr~Thr-Gln-Lys-Ser-Leu-Ser- 

Leu-Ser-Leu-Gly-Xaa23o (SEQ ID NO:7) 

wherein: 

Xaa at position 16 is Pro or Glu; 
Xaa at position 17 is Phe, Val, or Ala; 
Xaa at position 18 is Leu, Glu, or Ala; 
Xaa at position 80 is Asn or Ala; and 
Xaa at position 230 is Lys or is absent 

2. The heterologous fusion protein of Claim 1 wherein the C-terminal glycine 
residue of the GLP-1 analog is fused to the N-terminal alanine residue of the Fc 
portion via a peptide linker comprising a sequence selected from the group 
consisting of: 

a) Gly-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser (SEQ ID 
NO:8); 

b) Gly-Ser-Gly-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser- 
Gly-Gly-Gly-Gly-Ser (SEQ ID NO: 19); and 

c) Gly-Gly-Gly"Gly-Ser-Gly-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser-Gly-Gly- 
Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser (SEQ ID NO:21). 

3. The heterologous fusion protein of Claim 2 wherein the linker comprises the 
sequence of SEQ ID NO: 8. 
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4. The heterologous fusion protein of any one of Claims 1 to 3 wherein Xaa at 
position 8 of the GLP-1 analog is Gly. 

5. The heterologous fusion protein of any one of Claims 1 to 3 wherein Xaa at 
position 8 of the GLP-1 analog is Val. 

6. The heterologous fusion protein of any one of Claims 1 to 3 wherein the GLP-1 
analog comprises the sequence of SEQ ID NO:l. 

7. A heterologous fusion protein selected from the group consisting of: a) Gly 8 - 
Glu 22 -Gly 36 -GLP-l(7-37)-lL-IgG4 (S228P); b) Gly 8 -Glu 22 -Gly 36 -GLP-l(7-37)- 
lL-IgG4 (S228P, F234A, L235A); c) Gly 8 -Glu 22 -Gly 36 -GLP-l(7-37)-lL-IgG4 
(S228P, N297A); d) Gly 8 -Glu 22 -Gly 36 -GLP-l(7-37)-lL-IgG4 (S228P, F234A, 
L235A, N297A); e) Gly 8 -Glu 22 -Gly 36 -GLP-l(7-37)-1.5L-IgG4 (S228P); f) Gly 8 - 
Glu 22 -GIy 36 -GLP-l(7-37)-1.5L-IgG4 (S228P, F234A, L235A); g) Gly^Glu 22 - 
Gly 36 -GLP-l(7-37)-1.5L-IgG4 (S228P, N297A); h) Gly 8 -Glu 22 -Gly 36 -GLP-1(7- 
37)-1.5L-IgG4 (S228P, F234A, L235A, N297A); i) Gly^Glu^-Gly^-GLP-ia- 
37)-2L-IgG4 (S228P); j) Gly 8 -Glu 22 -Gly 36 -GLP-l(7-37)-2L-IgG4 (S228P, F234A, 
L235A); k) Gly 8 -Glu 22 -Gly 36 -GLP-l(7-37)-2L-IgG4 (S228P, N297A); 1) Gly 8 - 
Glu 22 -Gly 36 -GLP-l(7-37)-2L-IgG4 (S228P, F234A, L235A, N297A); and the des- 
K forms thereof. 

8. A heterologous fusion protein selected from the group consisting of: a) Val 8 - 
Glu 22 -Gly 36 -GLP-l(7-37)-lL-IgG4 (S228P); b) Val 8 -Glu 22 -Gly 36 -GLP-l(7-37)- 
lL-IgG4 (S228P, F234A, L235A); c) Val^Glu^-Gly^-GLP-K?^?)-!!^^ 
(S228P, N297A); d) Val 8 -Glu 22 -Gly 36 -GLP-l(7-37>lL-IgG4 (S228P, F234A, 
L235A, N297A); e) Val 8 -Glu 22 -Gly 36 -GLP-l(7-37)-1.5L-IgG4 (S228P); f) Val 8 - 
Glu 22 -Gly 36 -GLP-l(7-37)-1.5L-IgG4 (S228P, F234A, L235A); g) Val^Glu 22 - 
Gly 36 -GLP-l(7-37)-1.5L-IgG4 (S228P, N297A); h) Val^Glu^-Gly^-GLP-ia- 
37)-1.5L-IgG4 (S228P, F234A, L235A, N297A); i) Vd^Glu^-Gly^-GLP-ia- 
37)-2L-IgG4 (S228P); j) Val 8 -Glu 22 -Gly 36 -GLP-l(7-37)-2L-IgG4 (S228P, 
F234A, L235A); k) Val 8 -Glu 22 -Gly 36 -GLP-l(7-37)-2L-IgG4 (S228P, N297A); 1) 
Val 8 -Glu 22 -Gly 36 -GLP-l(7-37)-2L-IgG4 (S228P, F234A, L235A, N297A); and 
the des-K forms thereof. 

9. A polynucleotide encoding the heterologous fusion protein of any one of Claims 1 
to 8. 
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10. A vector comprising the polynucleotide of Claim 9. 

1 1. A host cell comprising the vector of Claim 10. 

12. A host cell expressing at least one heterologous fusion protein of any one of 
Claims 1 to 8. 

5 13. The host cell of Claim 12 wherein the host cell is a CHO cell. 

14. The host cell of Claim 12 wherein the host cell is a NS0 cell. 

15. A process for producing a heterologous fusion protein comprising the steps of 
transcribing and translating a polynucleotide of Claim 9 under conditions wherein 
the heterologous fusion protein is expressed in detectable amounts. 

10 16. A method of treating a patient with non-insulin dependent diabetes mellitus 

comprising the administration of a therapeutically effective amount of the 
heterologous fusion protein of any one of Claims 1 to 8. 

17. A method of inducing weight loss in an overweight patient comprising the 
administrations of a therapeutically effective amount of the heterologous fusion 

15 protein of any one of Claims 1 to 8. 

18. The method of Claim 16 or 17 wherein the heterologous fusion protein is 
administered at a dose between about 0.05 mg/kg to 0.5 mg/kg body weight. 

19. The method of Claim 16 or 17 wherein the heterologous fusion protein is 
administered once a week. 

20 20. Use of the heterologous fusion protein of any one of Claims 1 to 8 for use as a 
medicament. 

21. Use of the heterologous fusion protein of any one of Claims 1 to 8 for the 
manufacture of a medicament to treat non-insulin dependent diabetes mellitus. 

22. Use of the heterologous fusion protein of any one of Claims 1 to 8 for the 
25 manufacture of a medicament to treat obesity or induce weight loss in an 

overweight subject. 

23. Use of the heterolgous fusion protein of any one of Claims 1 to 8 for the treatment 
of a human or animal body by therapy. 
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24. A heterologous fusion protein comprising a GLP-1 analog comprising a sequence 
selected from the group consisting of: 

a) (SEQIDNO:l) 

His-Xaa8-Glu-Gly-Thr^Pte 

Gln-Ala-Ala-Lys-Glu-Phe-ne-Ala-Trp-Leu-Val-Lys-Gly^Gly-Gly 
wherein Xaag is selected from Gly and Val; 

b) (SEQ ID NO:2) 

ffis-Xaag-Glu-Gly-Thr-Phe-Thr^^ 

Gln-Ala-Ala-Lys-Glu-Phe-ne-Ala-Trp-Leu-Lys-Asn-Gly-Gly-Gly 
wherein Xaag is selected from Gly and Val; 

c) (SEQIDNO:3) 

His-Xaag-Glu-Gly-Thr-Phe-Thr-Ser-As^ 
Gln-Ala-Ala-Lys-Glu-Phe-ne-Ala-Tip-Leu-Val-Lys-Gly--Gly-Pro 
wherein Xaag is selected from Gly and Val; 

d) (SEQEDNO:4) 

ffis-Xaa 8 -Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Glu- 
Gin- Ala-Ala-Lys-Glu-Phe-Ile-Ala^Trp-Leu-Lys-Asn-Gly-Gly-Pro 
wherein Xaag is selected from Gly and Val; 

e) (SEQ ID NO:5) 

His-Xaag-Glu-Gly-Thr-Phe-Thr^ 

Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-Trp-Leu-Val-Lys-Gly-Gly 
wherein Xaag is selected from Gly and Val; 

f) (SEQDDNO:6) 

His-Xaas-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu-Glu- 
Gln-Ala-Ala-Lys-Glu-Phe-Ile-Ala-Trp-Leu-Lys-Asn-Gly-Gly 
wherein Xaag is selected from Gly and Val; 

fused to the Fc portion of an immunoglobulin comprising the sequence of SEQ ID 
NO:7 

Alai-Glu-Ser-Lys-Tyr-Gly-Pro-Pro-Cys-Pro-Pro-Cys-Pro-Ala-Pro- 
Xaa^-Xaan-Xaaig-GIy-Gly-Pro-Ser-Val-Phe-Leu-Phe-Pro-Pro-Lys-Pro- 
Lys-Asp-Thr-l^u-Met-De-Ser-Arg-Thr-Pro-Glu-Val-Thr-Cys-Val- 
Val-Val-Asp-Val-Ser-Gln-Glu-Asp-Pro-Glu-Val-Gto-Phe-Asn-Trp- 
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Tyr-Val-Asp-Gly-Val-Glu-Val-His-Asn^ 
Glu-Glu-Gln-Phe-Xaago-Ser-Thr-Ty^ 

Val-l^u-His-Gln-Asp-Tip-I^u-Asn-Gly-Lys-Glu-Tyr-Lys-Cys-Lys- 

Vd-Ser-Asn-Lys-Gly-I^u-Pro-Ser-Ser-ne-Glu-Lys-Thr-Ile-Ser- 

Lys-Ala-Lys-Gly-Gln-Pro-Arg-Gto^^ 

Pro-Ser-Gln-Glu-Glu-Met-Thr-^ 

I^u-Val-Lys-Gly-Phe-T^ 

Ser-Asn-Gly-Gln-Pro-Glu-Asn-Asn-Tyr-Ly^ 

Leu-Asp-Ser-Asp-Gly-Ser-Phe-Phe-I^u-^ 

Asp-Lys-Ser-Arg-Tip-Gln-Glu-Gly-Asn-V^ 

Met-His-Glu-Ala-L^u-His-Asn-His-T^ 

Leu-Ser-Leu-Gly-Xaa23o (SEQ ID NO:7) 

wherein: 

Ala at position 1 is absent; 
Xaa at position 16 is Pro or Glu; 
Xaa at position 17 is Phe, Val, or Ala; 
Xaa at position 18 is Leu, Glu, or Ala; 
Xaa at position 80 is Asn or Ala; and 
Xaa at position 230 is Lys or is absent 
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SEQUENCE LISTING 

<UO> Eli Lilly and company 

<120> glp-1 Analog Fusion Proteins 

<130> X-15984 

<150> 60/477880 
<151> 2003-06-12 

<160> 21 

<170> Patentln version 3.3 

<210> 1 

<211> 31 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<220> 

<221> MISC_FEATURE 

<222> (2).. (2) m m 

<223> Xaa at position 2 is Gly or val 

<400> 1 

His xaa Glu Gly Thr Phe Thr ser Asp Val Ser Ser Tyr Leu Glu Glu 
15 10 15 

Gin Ala Ala Lys Glu Phe lie Ala Trp Leu Val Lys Gly Gly Gly 
20 25 30 

<210> 2 
<211> 31 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic construct 
<220> 

<221> MISC__FEATURE 

<222> C2)..(2) . . 

<223> xaa at position 2 is Gly or val 

<400> 2 

His xaa Glu Gly Thr Phe Thr ser Asp val ser ser Tyr Leu Glu Glu 
15 10 15 

Gin Ala Ala Lys Glu Phe lie Ala Trp Leu Lys Asn Gly Gly Gly 
20 25 30 

<210> 3 
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<211> 31 
<212> PRT 
<213> Artificial 

<220> 

<223> synthetic construct 
<220> 

<221> MISC.FEATURE 

<222> (2).. (2) 

<223> Xaa at position 2 is Gly or val 

<400> 3 

His xaa Glu Gly Thr Phe Thr ser Asp Val ser ser Tyr Leu Glu Glu 
1 5 10 15 

Gin Ala Ala Lys Glu Phe lie Ala Trp Leu val Lys Gly Gly Pro 
20 25 30 

<210> 4 
<211> 31 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic Construct 
<220> 

<221> MISCL.FEATURE 

<222> (2) . . (2) 

<223> xaa at position 2 is Gly or Val 

<400> 4 

His xaa Glu Gly Thr Phe Thr Ser Asp val ser ser Tyr Leu Glu Glu 
1 5 10 15 

Gin Ala Ala Lys Glu Phe lie Ala Trp Leu Lys Asn Gly Gly Pro 
20 25 30 

<210> 5 
<211> 30 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic Construct 
<220> 

<221> MISC_FEATURE 
<222> (2) . . (2) 

<223> Xaa at position 2 is Gly or val 
<400> 5 

His Xaa Glu Gly Thr Phe Thr ser Asp Val Ser ser Tyr Leu Glu <5lu 
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15 10 15 



Gin Ala Ala Lys Glu Phe lie Ala Trp Leu val Lys Gly Gly 
20 25 30 



<210> 6 

<211> 30 

<212> PRT 

<213> Artificial 

<220> 

<223> synthetic Construct 



<220> 

<221> MISCFEATURE 
<222> (2).. (2) 

<223> Xaa at position 2 is Gly or val 
<400> 6 

His xaa Glu Gly Thr Phe Thr ser Asp val Ser Ser Tyr Leu Glu Glu 
15 10 15 



Gin Ala Ala Lys Glu Phe He Ala Trp Leu Lys Asn Gly Gly 
20 25 30 



<210> 7 

<211> 230 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 



<220> 

<221> MISC_FEATURE 

<222> (16) . . (16) 

<223> Xaa at position 16 is Pro or Glu 
<220> 

<221> MISCFEATURE 

<222> (17) . . (17) 

<223> Xaa at position 17 is Phe, Val, or Ala 
<220> 

<221> MI SC_ FEATURE 

<222> (18) . . (18) 

<223> Xaa at position 18 is Leu, Glu, or Ala 
<220> 

<221> MIS COFEATURE 

<222> (80).. (80) 

<223> Xaa at position 80 is Asn or Ala 
<220> 

<221> MI SC— FEATURE 

<222> (230).. (230) 

<223> Xaa at position 230 is Lys or is absent 
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<400> 7 

Ala Glu ser Lys Tyr Gly Pro Pro cys Pro Pro cys Pro Ala Pro xaa 
15 10 15 

Xaa xaa Gly Gly Pro Ser val Phe Leu Phe Pro pro Lys Pro Lys Asp 
20 25 30 

Thr Leu Met He Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp 
35 40 45 

Val ser Gin Glu Asp Pro Glu val Gin Phe Asn Trp Tyr Val Asp Gly 
50 55 60 

Val Glu val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin Phe Xaa 
65 70 75 80 

Ser Thr Tyr Arg val Val Ser Val Leu Thr val Leu His Gin Asp Trp 
85 90 95 

Leu Asn Gly Lys Glu Tyr Lys cys Lys val ser Asn Lys Gly Leu Pro 
100 105 110 

Ser Ser lie Glu Lys Thr He Ser Lys Ala Lys Gly Gin Pro Arg Glu 
115 120 125 

Pro Gin val Tyr Thr Leu Pro Pro Ser Gin Glu Glu Met Thr Lys Asn 
130 135 140 

Gin Val Ser Leu Thr Cys Leu val Lys Gly Phe Tyr Pro Ser Asp lie 
145 150 155 160 

Ala val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn Asn Tyr Lys Thr 
165 170 175 

Thr pro Pro Val Leu Asp ser Asp Gly ser Phe Phe Leu Tyr ser Arg 
180 185 190 

Leu Thr Val Asp Lys Ser Arg Trp Gin Glu Gly Asn Val Phe ser Cys 
195 200 205 

Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gin Lys Ser Leu 
210 215 220 

Ser Leu Ser Leu Gly Xaa 
225 230 

<210> 8 
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<211> 15 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 8 

Gly Gly Gly Gly ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
1 5 10 15 

<210> 9 

<211> 31 

<212> PRT 

<213> Homo sapiens 

<400> 9 

His Ala Glu Gly Thr Phe Thr ser Asp val Ser Ser Tyr Leu Glu Gly 
1 5 10 is 

Gin Ala Ala Lys Glu Phe lie Ala Trp Leu val Lys Gly Arq Glv 
20 25 30 

<210> 10 
<211> 71 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 10 

His Gly Glu Gly Thr Phe Thr ser Asp val ser Ser Tyr Leu Glu Glu 
15 10 is 

Gin Ala Ala Lys Glu Phe lie Ala Trp Leu Val Lys Gly Arq Gly Gly 
20 25 30 

Gly Gly Gly Ser Gly Gly Gly Gly ser Gly Gly Gly Gly Ser Gly Gly 
35 40 45 

Gly Gly ser Gly Gly Gly Gly ser Gly Gly Gly Gly Ser Ala Glu Ser 
50 55 60 

Lys Tyr Gly Pro Pro Cys Pro 
65 70 

<210> 11 

<211> 9 

<212> PRT 

<213> Artificial 

<220> 
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<223> Synthetic Construct 
<400> 11 

Trp Leu val Lys Gly Arg Gly Gly Gly 
1 5 



<210> 12 

<211> 7 

<212> prt 

<213> Artificial 

<220> 

<223> Synthetic construct 

<400> 12 

Trp Leu val Lys Gly Gly Gly 



<210> 13 

<211> 7 

<212> PRT 

<213> Artificial 

<220> 

<223> synthetic construct 

<400> 13 

Trp Leu Lys Asn Gly Gly Gly 



<210> 14 

<211> 7 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 14 

Trp Leu val Lys Gly Gly Pro 



<210> 15 

<211> 7 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic construct 

<400> 15 

Trp Leu Lys Asn Gly Gly Pro 
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<210> 16 
<211> 6 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic construct 
<400> 16 

Trp Leu val Lys Gly Gly 
1 5 

<210> 17 

<211> 6 

<212> PRT 

<213> Artificial 

<220> 

<223> synthetic construct 
<400> 17 

Trp Leu Lys Asn Gly Gly 

1 5 

<210> 18 

<211> 6 

<212> PRT 

<213> Homo sapiens 

<400> 18 

Pro Pro cys Pro Ser Cys 
1 5 

<210> 19 

<211> 22 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic construct 
<400> 19 

Gly ser Gly Gly Gly Gly Ser Gly Gly Gly Gly ser Gly Gly Gly Gly 
1 5 10 15 

Ser Gly Gly Gly Gly Ser 
20 

<210> 20 

<211> 825 

<212> DNA 

<213> Homo sapiens 

<400> 20 

cacggcgagg gcaccttcac ctccgacgtg tcctcctatc tcgaggagca ggccgccaag 60 
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gaattcatcg 


cctggctggt 


gaagggcggc 


ggcggtggtg 


gtggctccgg aggcggcggc 


120 


tctggtggcg 


gtggcagcgc 


tgagtccaaa 


tatggtcccc 


catgcccacc ctgcccagca 


180 


cctgaggccg 


ccgggggacc 


atcagtcttc 


ctgttccccc 


caaaacccaa ggacactctc 


240 


atgatctccc 


ggacccctga 


ggtcacgtgc 


gtggtggtgg 


acgtgagcca ggaagacccc 


300 


gaggtccagt 


tcaactggta 


cgtggatggc 


gtggaggtgc 


ataatgccaa gacaaagccg 


360 


cgggaggagc 


agttcaacag 


cacgtaccgt 


gtggtcagcg 


tcctcaccgt cctgcaccag 


420 


aactaactaa 


acaacaaaaa 


atacaaatac 


aaaatctcca 


araaaoacrt ccccftcctcc 


480 


atcgagaaaa 


ccatctccaa 


agccaaaggg 


cagccccgag 


agccacaggt gtacaccctg 


540 


cccccatccc 


aggaggagat 


gaccaagaac 


caggtcagcc 


tgacctgcct ggtcaaaggc 


600 


ttctacccca 


gcgacatcgc 


cgtggagtgg 


gaaagcaatg 


ggcagccgga gaacaactac 


660 


aagaccacgc 


ctcccgtgct 


ggactccgac 


ggctccttct 


tcctctacag caggctaacc 


720 


gtggacaaga 


gcaggtggca 


ggaggggaat 


gtcttctcat 


gctccgtgat gcatgaggct 


780 


ctgcacaacc 


actacacaca 


gaagagcctc 


tccctgtctc 


tgggt 


825 



<210> 21 

<211> 30 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic construct 

<400> 21 

Gly Gly Gly Gly Ser Gly Gly Gly Gly ser Gly Gly Gly Gly Ser Gly 
1 5 10 15 



Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
20 25 30 
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